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Abstract. We develop a model characterizing all possible knots and links arising from recombination 
f*"^ \ starting with a twist knot substrate, extending previous work of Buck and Flapan. We show that all 

knot or link products fall into three well-understood families of knots and links, and prove that given a 
positive integer n, the number of product knots and links with minimal crossing number equal to n grows 
I^H ' proportionally to n^ . In the (common) case of twist knot substrates whose products have minimal crossing 

r ^ ^ number one more than the substrate, we prove that the types of products are tightly prescribed. Finally, 

• , we give two simple examples to illustrate how this model can help determine previously uncharacterized 

yS^ • experimental data. 

ri . 1. Introduction 

The central axis of the famous DNA double helix can become knotted or linked as a result of numerous 
^Nf^ , biochemical processes, most notably site-specific recombination [T]-[3]. A wide variety of DNA knots and 

^ ■ links have been observed [fflTB], Characterising the precise knot or link type can often help understand 

V^ I structural or mechanistic features of the biochemical reaction p!6H27] . 
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Experimentally, such a characterization is typically achieved via gel electrophoresis (which stratify DNA 
products according to their minimal crossing number) [28] and electron microscopy (which allows us to 
visualize the over- and under-crossings of the DNA molecule) [29l[30] together with knot invariants such as 



^-v ■ the Jones polynomial (amongst many others) [31|. However, electron microscopy is not straightforward and 

often the precise over- or under-crossing cannot be categorically determined. Partial information can be 
gleaned by using gel electrophoresis but as there are 1,701,936 prime knots with minimal crossing number 
less than 17 this information is not sufficient |32J. Furthermore, gel electrophoresis does not distinguish 
between handedness of chiral knots, so this does not give the full picture. Thus topological techniques, such 
as those presented here, can aid experimentalists in characterizing DNA knotted and linked molecules by 
restricting the types of knots or links that can arise in a particular context. 

Here we focus on the most common biochemical reaction that yields DNA knots and links: site-specific 
recombination. Site-specific recombination is an important cellular reaction that has been studied extens- 
ively since the 1960s. It involves a reciprocal exchange between defined DNA segments. Biologically, this 
results in a variety of processes (see [33 and references therein). Apart from their fundamental functions 
in the cell, site-specific recombinases give scientists an elegant, precise and efficient way to insert, delete, 
and invert segments. Thus they are rapidly becoming of pharmaceutical and agricultural interest as well as 
being used in the development of biotechnological tools [3ll[35] . 

Twist knots are one of the most common DNA conformations. This is not surprising as in the cell most DNA 
is (plectonemically) supercoiled (like an over-used phone cord) and in the lab most experiments done with 
site-specific recombinases use small (plectonemically) supercoiled circular DNA molecules, so an unknot can 
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be transformed to a twist knot by a single crossing change (see Figure [T]). Unlike (2,n)-torus knots, twist 
knots occur as knots (not links) for both odd and even minimal crossing number, MCN(i^). Thus ubiquitous 
DNA twist knots arise as a result of a variety of site-specific recombination reactions [3)[5HTT] . 

Despite the biological importance of this twist knot family, there has yet to be a systematic model in- 
corporating these as substrates for a generic site-specific recombinase. (Earlier predictions of knots arising 
from site-specific recombination did not consider twist knots J36H38] ). Here we rectify this by presenting a 
model, extending the work of [37], classifying all possible knots and links that can arise from site-specific 
recombination on a twist knot. 

Our model is built on three assumptions for which biological evidence is provided in [38l[39]. We con- 
struct a model that predicts all possible knots and links that can arise as products of a single round of 
recombination, multiple rounds of (processive) recombination, and of distributive recombination, given a 
twist knot substrate C{2^v) and our three assumptions. We predict that products arising from site-specific 
recombination on a twist knot substrate C{2^v) must be members of the three families of products illus- 
trated in Figure [2l Members of these families of knots and links include prime and composite knots and 
links with up to three components (see Section 2.3). Our model can also distinguish between the chirality 
of the product molecules of site-specific recombination (see Section 5). Our model is independent of site 
orientation, and we make no assumption on the number of base pairs of the molecule(s). 

1.1. Structure of our paper. This paper is organised as follows: in Section [2] we give a concise intro- 
duction to site-specific recombination and introduce notation. In Section [3] we state and explain the three 
assumptions about the recombinase complex, the substrate and the mechanisms of recombination. (Biological 
justifications for these assumptions can be found in [381 139]). In Section |4] we determine the pre-recombinant 
and post-recombinant conformations of the recombination sites and all possible conformations of the DNA- 
protein complex; we also prove the necessary background lemmas for Section [5] In Section [5] we prove 
Theorems 1 and 2 which determine all the putative DNA knot and link products of (non-distributive) site- 
specific recombination on a twist knot substrate. We show that these products belong to one of the three 
families of knots and links illustrated in Figure [21 (These families of knots and links are defined in section 
2.3). We also identify all knots and links that arise as products of distributive site-specific recombination. In 
Section [6l we prove Theorem 4 which shows that all the possible DNA knot and link products of site-specific 
recombination on a twist knot substrate are a very small fraction of all knots and links. We also further 
restrict the knot and link types of products that have minimal crossing number one more that of the sub- 
strate. Finally, in Section [T] we consider two simple uses of our model. For a detailed biological discussion 
of applications of our model, and how to use this model as a tool in a variety of site-specific recombination 
systems, we refer the reader to [39] . 

2. Biological systems and terminology 

In this section we give a concise introduction to site-specific recombination, introduce notation, and describe 
the families of knots and links that arise as products of site-specific recombination on twist knot substrates. 

2.1. Site-specific recombination. Site-specific recombination reshuffles DNA sequences by inserting, de- 
leting or inverting DNA segments of arbitrary length. As such, it mediates a variety of important cellular 
processes including chromosome segregation and viral infections. (See the review [33^ for more details). Min- 
imally, site-specific recombination requires both particular proteins {site- specific recombinases) and two short 
(30-50bp) DNA segments (the crossover sites) within one or two DNA molecules {the substrate). (More com- 
plex site-specific recombination systems may also require additional proteins (called accessory proteins) and 
DNA sites (called enhancer sequences).) Site-specific recombinases can be broadly divided into two subfamil- 
ies: serine site-specific recombinases and tyrosine site-specific recombinases, based on their catalytic residues. 

Site-specific recombination roughly has three stages (see Figure [3]). First, two recombinase molecules bind 
to each of two crossover sites and bring them close together. (The sites together with the four bound re- 
combinases is called the synaptic complex.) Second, the crossover sites are cleaved, exchanged and resealed. 
(The precise nature of this intermediary step is determined by the recombinase subfamily, see Assumption 



3 and Figures Bal and l4bl ) . And finally, the rearranged DNA {the product) is released. 

Multiple rounds of strand exchange can occur before releasing the DNA: this process is known as pro- 
cessive recombination. (See Assumption 3 and Figure O) This is in contrast to distributive recombination^ 
where multiple rounds of the entire process of recombination (including releasing and rebinding) occurs. 
Only serine recombinases can mediate processive site-specific recombination, but both types of recombinases 
can mediate distributive recombination. In this work we use the term substrate to refer specifically to the 
DNA prior to the first cleavage. We treat processive recombination as one extended process (with several 
intermediate exiting points for the reaction). 

2.2. Mathematical terminology. A twist knot is a knot that admits a projection with a row of '^ 7^ 
vertical crossings and a hook, as in Figure [6b] and is denoted by C(±2, v). If r = —2, by flipping the top loop 
we get r = +2 and add a positive crossing to the row of v crossings (see this isotopy illustrated in Figure 
[6c|) . Thus from now on we assume that our substrate is the twist knot (7(2, 'u), v ^ 0. (See [4QH44] for a 
detailed discussion on twist knots). 

Note: Twist knots can be generalized to clasp knots. A clasp knot C(r, v) is a knot that has two non- 
adjacent rows of crossings, one with r 7^ 0, ±1 crossings and the other with v ^ crossings (Figure [6a|). (By 
adjacent rows of r and v crossings we mean that the two rows cannot be considered as a single row of r -\- v 
crossings as they can in the case of the torus knots and links T(2, r -\- v). A clasp knot C(r, v) with r = ±2 
is a twist knot.) 

We use the following terminology and notation. We consider the central axis of the DNA double helix 
and therefore, when we illustrate DNA molecules we draw this axis (and not the two DNA backbones that 
make up the double helix). Let J denote the twist knot substrate molecule. Once the synaptic complex has 
been formed, the recombinase complex, B, denotes the convex hull of the four bound recombinase molecules 
together with the two crossover sites. (Note that 5 is a topological ball). The recombinase-DNA complex, 
J yj B, denotes the union of the substrate J with the recombinase complex B. Let C = cl(R^ — B) and let 
C n J denote the complement of the recombinase complex. If the recombinase complex meets the substrate 
in precisely the two crossover sites then we say the recombinase complex is a productive synapse, see Figure 
H In particular, for recombinases that utilize an enhancer sequence or accessory proteins, the recombinase 
complex is a productive synapse if the accessory sites and proteins are sequestered from the crossover sites 

2.3. Notation for families of knots and links that arise from site-specific recombination on a 
twist knot. We now discuss the three families of knots and links that we encounter in the main results of this 
paper. The families of knots and links illustrated in Figures [2a] [2bl and [2cl are referred to as F{p, q, r, s, t, u), 
Gl and G2 respectively. 

We note that F{p, q, r, s, t, u) is a special family of knots and links. In [45l it is shown that a standard 
rational tangle diagram corresponds to any expansion of a rational number - as a continued fraction; in 
this paper we choose the convention that a choice of the expansion in which all terms have alternating 
sign gives an alternating diagram (see Figure [6d] for a convention on crossings and Figure [Hb for an ex- 
ample of a rational tangle diagram). A Montesinos link is a link L that admits a diagram D composed of 
771 > 3 rational tangle diagrams Ri,...,Rm and /c > half twists glued together as in Figure [8^ (and see 
e.g. [50]). Members of F{p,q,r,s,t,u) are obtained by the numerator closure of Montesinos tangles of the 
form (^^;^, :p^^^ ^^^fi)' That is, for three standard rational tangle diagrams with fractions j:^^^ :p^^ and 
^^, take their partial sum as in Figure [8h and then the closure of the diagram as in Figure [Hli. Denote the 
tangle with corresponding rational number ^^A^, ;:^t:i ^^^ ^+1 ^^ ^17^2 and Rs respectively. As in [37], 
we define the family of small Montesinos knots and links to be the family of Montesinos links for i = 3, Ri as 
above and k = 0. Thus, our family of knots and links F{p,q,r,s,t,u) illustrated in Figure {2a\ is a subfamily 



Note that if S is a productive synapse then it can be thought of as a (2-string) tangle. However, unhke in the traditional 
tangle model, the complement of B may take a variety of forms (not necessarily that of a tangle), so we avoid this potentially 
confusing terminology. 
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of small Montesinos knots and links. 

In the family F{p^q^r^s^t^u) of knots and links, the variables p^q^r^s^t^u describe the number of crossings 
between two strands in that particular row of crossings. Note that knots that are members of this family 
can be prime or composite and links belonging to this family can have up to three components. In this 
family, the variables p, g, r, s, t, ii can be positive, negative or zero. By letting the variables equal or ±1 as 
appropriate, we obtain the subfamilies illustrated in Figure [9l Subfamily 1 is denoted by F^^ (0, g, r, 5, t, i^) 
with \r\ > 0, |t| > 1. Subfamily 2 is denoted ^5-2(^1,^,^,5,^1,1^) with \r\ > 1. Subfamily 3 is denoted 
^5-3(^1, g, r, s, t, u) with |r|, \t\ > 1. Subfamily 4 is denoted Fs^{p^ q^r^ s^t^ u) when we forbid p,t, r = {0, ±1}. 
Subfamily 5 are composite knots or links T(2, ii)tlC(p, g) formed from a torus knot and a twist knot. Sub- 
family 6 is a subfamily of F(p, g, r, 5, t, i^) with p + g = 0. Subfamily 7 is a family of clasp knots and links, 
C(r, s). (Recall that it is a generalization of the family of twist knots, which we consider in this paper as 
the substrate molecule for site-specific recombination.) Subfamily 8 is the family of torus knots and links, 
T(2,r). Finally, subfamily 9 is the family of pretzel knots K{p^s^u). (Note that some of the subfamilies 
in Figure [9] are special cases of other subfamilies, for example in Subfamily 2, if we let q = then we get 
Subfamily 5. Similarly for Subfamilies 1 and 7, 3 and 6.) 

In the families Gi(k) and G2(k) of knots and links, the variable k describes the number of crossings between 
the two strands. Depending on the value of /c, we obtain either a knot or a link: For Giik) for z = 1,2, if 
k is odd, the members of these families are knots. If k is even, then the members of this family are two 
component links. These families are illustrated in Figures [2bl and [2cl 

Note that there are a few knots and links that belong to both F(p, g, r, s, t, u) and either Gi{k) or G2{k). For 
example the trefoil knot has a projection as a member of F(p, q, r, 5, t, u) with p = O^t^u = l,r = 2,5 = —1, 
and a projection as a member of G2{k) with k = 2. 



3. Assumptions 

In this section we state and explain the three assumptions about the recombinase complex, the substrate and 
the mechanisms of recombination. (Biological justifications for these assumptions can be found in J381 I39]). 

We make the following three assumptions about the recombinase-DNA complex, which we state in both 
biological and mathematical terms. These assumptions are similar in [371[38]. However, for Assumption 2 
in particular, we introduce new terminology and prove a necessary result in order to re-state this Biological 
Assumption in precise mathematical terms. In [38l[39] we provide experimental evidence showing that each 
of these assumptions is biologically reasonable. 

Biological Assumption 1. The synaptic complex is a productive synapse, and there is a projection of the 
crossover sites which has at most one crossing between the sites and no crossings within a single site. 

Mathematical Assumption 1, BnJ consists of two arcs and there is a projection of B f] J which has at 
most one crossing between the two arcs, and no crossings within a single arc. 

Fix a projection of J such that B H J has one of the forms illustrated in Figure [TOl Observe that form Bl 
can be rotated by 90° to obtain form B2. However, we list form Bl and B2 as two different forms to make 
subsequent figures easier to follow (similarly for forms B3 and B4). 

Note that hooked productive synapses, illustrated in Figure \Qd\ are biologically possible because there exist 
many recombinases whose productive synapse is not characterized, and for these systems BnJ could be 
hooked. However, this does not contradict Assumption 1, since a hook has no projections with no crossings, 
but has projections where there is only one crossing. There is an isotopy of the substrate molecule taking a 
hook from a projection with two crossings to a projection with one crossing, without affecting the projection 
of the substrate molecule outside a neighbourhood of the hook (Figure [TTI illustrates this). 



Biological Assumption 2. The synaptic complex does not pierce through a supercoil or a branch point in 
a nontrivial way and the supercoiled segments are closely juxtaposed. Also, no persistent knots or links are 
trapped in the branches of the DNA on the outside of the synaptic complex. 

Here persistent knots or links are those that remain after a continuous deformation of the DNA molecule, 

keeping B fixed. 

Before we can state Assumption 2 mathematically, we need to introduce some terminology. 

We define a planar surface with twists as in [37]. Consider a surface lying in a plane together with a 
finite number of arcs in the surface whose endpoints are on the boundary of the surface (see Figure [T2fa)). 
We can use this planar surface with arcs to obtain a non-planar surface by replacing a neighborhood of each 
arc in the original surface by a half-twisted band and removing the top and bottom ends of the band (see 
Figure [121 6)). Figure [T2l illustrates how such a surface can be obtained from a doubly-punctured planar 
disc together with a collection of arcs defining the twists. A planar surface with twists is defined to be any 
surface which can be obtained from a planar surface in this way. 

Define a surface D with boundary J to be a spanning surface for J \i D \s topologically equivalent to 
a doubly-punctured planar disc with twists when J is a twist knot (Figure [T2]). (We can think of a spanning 
surface for J as a soap film surface with boundary J.) In the construction of this spanning surface, in Figure 
[T2fa), we choose the twisted band that replaces the arc connecting the boundary of the planar disc and the 
right-most puncture and the twisted band that replaces the arc connecting the two punctures, such that the 
corresponding crossings defined on the non-planar surface with twists make a set of +2 horizontal crossings 
that we call a clasp^ illustrated in Figures [T2l and [T3l 

Figure [14] shows examples of the relationship between the spanning surface D and the recombinase complex. 
Observe that in illustrations (i) and (ii) D fl dB consists exactly two arcs. In illustration (iii), no matter 
how the spanning surface D is chosen, D fl dB contains at least one circle as well as two arcs, whereas in 
illustration (iv) there is an isotopy that removes the circle in D fl dB. Mathematically, a spanning surface 
D is pierced non-trivially by B if and only if D f] dB contains at least one circle in addition to the required 
two arcs, and there is no ambient isotopy of D that removes this additional circle. 

Claim. The intersection of any spanning surface for J and dB contains exactly two arcs. 

Proof. By Assumption 1, B contains exactly two arcs of J = dD, thus dB fl J is precisely four points. 
It follows that the intersection of any spanning surface for J with dB contains exactly two arcs, whose 
endpoints are the four points dB fl J. By Biological Assumption 2, B does not pierce the interior of any 
spanning surface I^ in a non-trivial way. Thus D D dB consists of exactly two arcs and no circles that cannot 
be removed by an ambient isotopy of D. D 

Suppose that I) is a spanning surface for J. We know by Biological Assumption 2 that the supercoiled seg- 
ments of the DNA molecule J are closely juxtaposed, this means that we can visualize the spanning surface 
D as a narrow soap film surface. In particular, this means that the two arcs in D D dB are each very short, 
so we can assume that they are co-planar. (Note that this does not mean that the crossover sites themselves 
{dD n B) are co-planar). 

We can now define a surface D D C to he unknotted relative to dB if there is an ambient isotopy of C, 
point-wise fixing dB^ that takes D DC to di doubly-punctured planar disc with twists, where the end points 
of the arcs defining the twists are disjoint from dB. Illustration (ii) of Figure [HI shows an example of DnC 
unknotted relative to dB. Illustration (i) shows a knot trapped in the substrate molecule outside of B. 

We are now ready to state Assumption 2 mathematically. 

Mathematical Assumption 2. J has a spanning surface D such that DDdB consists of exactly two arcs, 
the two arcs are co-planar and D nC is unknotted relative to dB. 

5 



The fact that J has a spanning surface D satisfying Assumption 2 means our model is independent of the 
projection of the substrate J, so we now fix a projection of J as in Figure [6b] and from now on we work 
with this particular projection J. Note that here we are referring specifically to the substrate J before the 
synaptic complex is formed. The conformations of the pre-recombinant recombinase-DNA complex are dealt 
with in Section 111 

Recall site-specific recombinases fall into two subfamilies, the serine recombinases and the tyrosine recom- 
binases. The details of the mechanism differ depending on which subfamily the recombinase belongs to. 
Assumption 3 addresses the mechanism for each subfamily of recombinases. 

Biological Assumption 3 for serine recombinases. Serine recombinases perform recombination via the 
subunit exchange mechanism. This mechanism involves making two simultaneous double- stranded breaks in 
the sites, rotating two recombinase molecules in opposite sites by 180° within the productive synapse and 
resealing the new DNA partners (Figure [Tap. In each subsequent round of processive recombination, the 
same set of subunits is exchanged and the sense of rotation remains constant. 

Biological Assumption 3 for tyrosine recombinases. After recombination mediated by a tyrosine re- 
combinase, there is a projection of the crossover sites which has at most one crossing (Figure^J^. 

The mathematical statement is as follows: 

Mathematical Assumption 3 for serine recombinases. After (each round of processive) recombination 
mediated by a serine recombinase, there is precisely one additional crossing between the crossover sites, (see 
Figure{^. 

Mathematical Assumption 3 for tyrosine recombinases. After recombination mediated by a tyrosine 
recombinase, there is a projection of the crossover sites which has at most one crossing (Figure[TQ). 

4. Possible forms of the productive synapse and its complement 

In this section we determine the pre-recombinant and post-recombinant conformations of the recombination 
sites and all possible conformations of the DNA-protein complex. We also prove the necessary background 
lemmas for Section [5l 

4.1. Possible forms of the productive synapse B H J. As a result of Assumption 2, we have fixed a 
projection of J prior to cleavage such that B D J has form 51, 52, B3 or 54, illustrated in Figure [TOl It 
follows from Assumption 3 that after n recombination events with serine recombinases, we have added a row 
of either n — l,norn + l identical crossings that can be positive, negative or zero. Without loss of generality, 
we assume that after n recombination events with serine recombinases, we add a row of n identical crossings 
that can be positive, negative or zero. Thus after n recombination events our fixed projection of 5 fl J is 
isotopic to one of the forms nl or n2 illustrated in Figure [5l (Note that from nl we can obtain n2 by rotating 
by 90°. However, we list them as separate forms in order to make it easier to follow the use of Figure [15] in 
the proof of Theorem 2.) 

For tyrosine recombinases, without loss of generality we assume that the post-recombinant projection of 
B n J has one of the eight forms in Figure [TH Notice that conformations B5, B6,B7 and 58 are hooks. 
Hooks have no projections with no crossings but do have projections with one crossing, so we allow these 
conformations. Forms 51,53,55 and 57 are equivalent by a 90° rotation, to forms 52,54,56 and 58 
respectively. (We list them separately to make it easier to follow the use of Figure [iTl in the proof of Theorem 

4.2. Possible forms of the complement of the productive synapse C D J. In this section we determine 
all the possible conformations of Cfl J, and determine the respective pre-recombinant conformations of 5n J 
for each form of C fl J. For simplicity, we will use the phrase ^C H J has a particular form' when we mean 
that 'C n J is ambient isotopic, pointwise fixing 95, to that form'. The forms of C fl J referred to in the 
lemma are illustrated in Figure [TSl 
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Lemma 1. Suppose that Assumptions 1, 2 and 3 hold for a particular recombinase-DNA complex with 
substrate J. Let J be a twist knot C{2^v). Then C D J has one of possible five forms listed below. For each 
of these, B has corresponding forms: 
If C n J has the form 

• CI, then B n J has the form Bl 

• C2, then B n J has the forms Bl, B3 or B4 

• C3, then B n J has the forms B2, B3 or B4 

• CA, then B f] J has the form Bl 

• C5, then B f] J has the forms B3 or BA. 

Proof. By Assumption 2, we can choose a spanning surface I^ to be a doubly-punctured planar disc with 
twists as in Figure [121 such that D fl dB is two co-planar arcs and D {^C is unknotted rel dB. 

Consider the doubly-punctured disc that generates D in Figure [T2lf a). We can consider this punctured 
planar disc with arcs as a thrice-punctured S'^ with a collection of arcs connecting the punctures (the three 
punctures are numbered 1, 2 and 3 in Figure [T2lf a)). A thrice-punctured S'^ in S^ with arcs connecting the 
three punctures can be regarded as a graph with three points and a collection of arcs connecting them, as 
illustrated in Figure [T9l 

We determine all possible conformations of C fl J in three steps as follows: 

First we consider all different possible locations of the specific sites D H dB on the thrice-punctured S'^ . 
The two specific sites can be located either both on the boundary of one puncture of S'^ or on a combination 
of these, so we consider each case. Notice, however, that from the symmetry of the graph described above 
it is enough to consider only the cases where the sites are located either: 

• Case (11): both on the boundary of puncture 1, 

• Case (22) or (33): both on the boundary of puncture 2 (equivalent to both on puncture 3), 

• Case (12) or (13): one site on the boundary of puncture 1 and the other on the boundary of puncture 
2 (equivalent to one site on puncture 1 and the other on puncture 3) and 

• Case (23): one site on the boundary of puncture 2 and the other on puncture 3. 

Next, on the corresponding spanning surface D (generated by the thrice-punctured S'^) we consider all pos- 
sibilities for D n B, which can either be two discs or a (possibly twisted) band. 

Finally, we perform an appropriate isotopy of this spanning surface which maps it to a spanning surface 
having one of the conformations of C fl J illustrated in Figure [T8l as boundary. We do this for every case. 

In Figures [20l [21] and [22] we illustrate the isotopy of C fl J to one of the standard forms CI, C2, C3, C4 
or C5 (or we show that such a case is not allowed by assumption) for each case. Even though cases (22) 
and (33) (and (12) and (13)) are equivalent, we illustrate all of them since it may be more straightforward 
to visualise the isotopy in one case or the other. 

In Figures [20l [21] and [221 inside each box we have three sets of illustrations: 

Left illustration: thrice punctured S'^ with (thin, long) arcs which define the twists on a non-planar 
surface and (thick, short) arcs on the boundary of one of the punctures (or a combination of these punctures) 
defining the arcs D fl dB. 

Middle illustration: the spanning surface D of our substrate J = C{2,v) with the arcs D fl dB illus- 
trated by a pair of thick, short arcs. 

Right illustration: corresponding conformation of C fl J. 

Since there are many cases and some of whose isotopies are not very complicated, we have illustrated 
all the cases in Figures \20\ [2T] and [22l Here we describe two of the most involved in detail. 

Case (lie): Assume both arcs of DOdB lie on the boundary of puncture 1 of the thrice-punctured S'^ C S^. 
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Consider case (lie) as illustrated in Figure [2Ql The thrice-punctured S'^ generates the spanning surface D 
illustrated. Figure [23l illustrates a continuous deformation taking this conformation of Z) to a conformation 
whose boundary is of the form CI. 

Case (12a): In Figure [22] consider case (12a). The thrice-punctured S'^ generates the spanning surface 
D illustrated. Figure [23] illustrates a continuous deformation taking this conformation of I) to a conforma- 
tion whose boundary is of the form CI. 

Here C ^ D \s unknotted rel dB so the left and middle forms illustrated in Figures [20l [21] and [22] yield 
(up to isotopy, fixing dB) the corresponding forms of C fl J illustrated on the right images. Thus, we can 
also specify the pre-recombinant form of 5 H J for each conformation of C fl J as shown in Figure [18] □ 

Observations 

Since B f] J contains at most one crossing, the component of D with almost all of the twists of C(2, s) 
must be contained in C. In form (72, while there may be twists to the right of B, they are topologically 
insignificant, since they can be removed by rotating C D D hy some multiple of tt. In form CI, any twists 
which had occurred above B can be removed and added to the row of twists below B by rotating C f) D 
by some multiple of tt. These rotations can occur while pointwise fixing B. Thus the five forms of C fl J 
illustrated in Figure [TS] are the only ones possible. 

5. Characterization of knots and links arising as products of site-specific recombination 

on a twist knot 

In this section we prove Theorems 1 and 2 which determine all the putative DNA knot and link products of 
(non-distributive) site-specific recombination on a twist knot substrate. We show that these products belong 
to one of the three families of knots and links illustrated in Figure [2] (These families of knots and links are 
defined in section 2.3). We also identify all knots and links that arise as products of distributive site-specific 
recombination. 

Here we use our preliminary work from Section 12.31 to prove our main results. In this section, we sup- 
pose that the substrate is a twist knot C{2^v) and that all three of our assumptions hold for a particular 
recombinase-DNA complex. We prove Theorems 1 and 2 which characterize all possible knotted or linked 
products brought about by a non-distributive reaction with a tyrosine recombinase and a serine recombinase, 
respectively. Most knotted and linked products are in the family F(p, g, r, 5, t, u). However, there are a series 
of products of site-specific recombination with a tyrosine recombinase that instead belong to one of Gi{k) 
or G2{k) (see the proof of Theorem 1 and Figure [TTj). In this section we also discuss knots that cannot arise 
as products of different scenarios of site-specific recombination on twist knots. 

Theorem 1. Suppose that Assumptions 1, 2 and 3 hold for a particular tyrosine recombinase-DNA complex 
with substrate J. If J is a twist knot C{2^v) then the only possible products (of a non- distributive reaction) 
are the unknot, the Hopf link, C{r^ s) for r = {1, 2, 3, 4}, T(2, m), a connected sum T(2, 7Ti)tlC(2, s), a member 
of the family F{p^ g, r, 5, t, u) in Figure [2a\ with \r\ > 2, \t\ = 1 or 2,\p\ < 1, or a member of the family of knot 
and links Gl or of the family of knots and links G2. 

The possible products are illustrated in Figure [171 

Proof By Assumption 3, after recombination with a tyrosine recombinase B D J has one of the eight post- 
recombinant forms illustrated in Figure [16] By Lemma 1, C fl J has one of the five forms illustrated in 
Figure [18] The products of recombination mediated by a tyrosine recombinase are obtained by replacing 
the pre-recombinant forms of 5 fl J in each of the forms of C fl J (in Figure [T8|) with each of the eight 
post-recombinant forms of B f] J (in Figure [T6|) . The resulting products are illustrated in Figure [TTl 

More specifically, suppose that J is C{2,v). Then by Lemma 1, C fl J can have form d,C2,C3,C4 or 
(75. Hence by Figure [TT] the possible products are the unknot, C(r, s) for r = {1,2,3,4}, T(2,7ti), a Hopf 
link, a connected sum T(2,7Ti)tlC(2, s), a member of the family F(p, g, r, 5, t, ix) in Figure [2a] with \t\ = 1 or 
2,|p| < 1 and a knot or a link that has a projection in either Gl or G2. 



Note that from Figure [TTl we can see that ah the possible products of site-specific recombination medi- 
ated by a tyrosine recombinase on a twist knot substrate belong to one of the subfamilies of F{p^ g, r, s, t, u) 
as illustrated in Figure [9l with one exception. For the image on column 7, row 5 of Figure [T71 depending on 
the value of v^ we get different knots or links: 
If V is an odd number, then the product is a knot: 

If V is a negative odd number, the product is a knot belonging to family Gl with k = \v\. 
If 'U is a positive odd number, the product is a knot belonging to family G2 with k = \v\ — 1. 
If V is an even number, the product is a two component link: 

If V is a negative even number, the product is a link belonging to family Gl with c = \v\. 
If V is a positive even number, the product is a link belonging to family G2 with c = \v\ — 1. 

Thus, with the exception of one sequence of products, all products of recombination with a tyrosine re- 
combinase belong to the family of small Montesinos knots and links illustrated in Figure [2al D 

It follows from Theorem 1 that every product of recombination with tyrosine recombinases is a member of 
the family in Figure [2al with |t| = 1,2 and \p\ < 1, or a member of the families Gl and G2. Also, it follows 
from Figure [9] that C{2^s) (possibly with an additional trivial component) T(2,m) and T(2, m)tiC(2, 5) can 
be obtained as knots or links in F(p, g, r, 5, t, u) with \t\ = 1,2 and \p\ < 1. 

Theorem 2. Suppose that Assumptions 1, 2 and 3 hold for a particular serine recomhinase-DNA complex 
with substrate J. If J is C(2, v) then the only possible products (of a non- distributive reaction) are the C{r^ s), 
T(2, m), a connected sum T(2, m)^C{2^ s) and any member of the family in Figure [2o\ with \r\ > 2, t 7^ and 

\p\ < 1. 

The possible products are illustrated in Figure [T5l 

Proof By Assumption 3, after recombination with a serine recombinase, B f] J has one of the two post- 
recombinant forms nl and n2 illustrated in Figure [51 Also, by Lemma 1, C fl J has one of the five forms 
illustrated in Figure [181 For Assumption 3 for serine recombinases, for each of the forms of C D J, the 
products of recombination with serine recombinases are obtained by replacing each of the pre-recombinant 
forms of B nJ with their corresponding post-recombinant form of B DJ after n rounds of processive recom- 
bination according to Figure [5l The resulting products are illustrated in Figure [T5l 

More specifically, suppose that J is C(2, v). Then according to Lemma 1, CD J can have forms CI, C2, C3, C4 
or C5. When C H J has form (71, then B D J must have form Bl. It follows from Figure [51 that the post- 
recombinant form of B n J must be of form n2. Thus, by replacing B D J with Bl in CI, we obtain that 
the products can be any knot or link in subfamily 3 illustrated in Figure [9l When C D J has form C2, then 
BnJ must have form 52, B3 or B4. In this case by Figure [51 the post-recombinant form of Bn J must be of 
form nl or n2. We see from form C2 in Figure [TSl that the products can be any knots or links in subfamily 
5 or subfamily 7 illustrated in Figure [9l A similar analysis is made on the other possible forms of C fl J to 
arrive to the conclusion that the products can be any knots or links in subfamilies 1, 3, 5, 7 or 8 illustrated 
in Figure [9l and thus, are members of the family F(p, g, r, 5, t, u) (See Figure [T5j). D 

Table 1 summarizes the results of Theorems 1 and 2. 

Note: Theorems 1 and 2 distinguish between the chirality of the product DNA molecules, since using 
our model we can work out the exact conformation of all possible products of site-specific recombination 
starting with a particular twist knot susbtrate and site-specific recombinase. For example, starting with 
the twist knot substrate C(2,— 1) (a right-handed (or (-h)) trefoil), according to our model, site-specific 
recombination mediated by a tyrosine recombinase yields T(2, 5), which is a (+) 5i (among other products) 
and can never yield T(2,-5), which is a (-) 5i. For an explicit strategy see our paper [39]. 

5.1. Knots and links that cannot arise as products. There are a number of simple knots and links 
that cannot arise as products of non-distributive site-specific recombination. 
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Corollary 3. Suppose that Assumptions 1, 2 and 3 hold for a particular site-specific recombinase-DNA 
complex with substrate a twist knot C(2, v). Any product arising that falls outside of families F{p^ g, r, s, t, u), 
Gl or G2 must arise from distributive recombination. 

For example, the knot 8i8 is a knot that is not Montesinos, thus it does not belong to our family of small 
Montesinos knots and links. It also does not belong to either Gl or G2, so 8i8 is an example of a knot that 
cannot arise as a product of non-distributive recombination on a twist knot substrate. 

Knots and links in F{p^ g, r, 5, t, u) that cannot arise from recombination mediated neither by a serine recom- 
binase, nor a tyrosine recombinase. The knot IO141 cannot be expressed in F(p, g, r, s, t, u) with t ^ and 
\p\ < 1. Recall that all products from recombination with a tyrosine recombinase or a serine recombinase be- 
longing to F(p, g, r, s, t) can be expressed with r > 2, t 7^ and |p| < 1. Thus IO141 cannot arise as a product. 

Knots that cannot arise from recombination mediated by a tyrosine recombinase. There are knots and links 
in F{p^q^r^s^t^u) which do not have a projection with \t\ = {±1,±2} and \p\ < 1, for example, the knot 
811 = F(2, 2, 2, — 1, — 3, 0). By inspection we can see that there is no way to express 811 as a member of 
F(p, q^ r, s, t, u) with t = {±1, ±2} and \p\ < 1, hence 811 is not a product of recombination with a tyrosine 
recombinase. The knot 1064 is another example of this. 

Knots that can arise as products of recombination mediated by a serine recombinase, but not by a tyrosine 
recombinase. In contrast with Theorem 1, any knot or link in the family illustrated in Figure [2al with t j^ 
(not just t = {±1,±2}) and \p\ < 1, can occur as a consequence of Theorem 2. The knot 811 mentioned 
above is an example of this; this knot is a possible product of recombination with a serine recombinase, but 
not with a tyrosine recombinase. 

6. Minimal crossing number of our model 

In this section we prove Theorem 4 which shows that all the possible DNA knot and link products of site- 
specific recombination on a twist knot substrate are a very small fraction of all knots and links. We also 
further restrict the knot and link types of products that have minimal crossing number one more than that 
of the substrate. 

6.1. The growth of product knots and links is proportional to n^. To prove the main theorem of this 
section. Theorem 4, we will split the family F(p, g, r, s, t, u) of knots and links into seven smaller subfamilies 
illustrated in Figure [25l Theorem 4 is independent of how family F(p, g, r, s, t, ii) is split up since we are 
using these subfamilies to count all the possible knots and links belonging to F{p^ g, r, 5, t, u). 

Definition. For a knot or link K the minimal crossing number MCN(i^) is the smallest number of crossings 
over all possible projections. For a knot or link K, denote its minimal crossing number by MCN(i^). 

The number of prime knots and links (links with up to two components and counting chiral pairs separately) 
with minimal crossing number n grows exponentially as a function of n [46]. By contrast, we now prove 
that the total number of knots and links with MCN(i^) = n that are putative products of site-specific 
recombination on a twist knot substrate grows linearly as a function of n^. Our families include prime 
and composite knots and links with up to three components. For the purposes of this section, we do not 
distinguish handedness of chiral knots, however, even including both versions of chiral knots, still our family 
grows slower than the function of n^ multiplied by 2. This actually means that all the possible prime knot 
and (two-component) link products of site-specific recombination on a twist knot substrate are a very small 
fraction of all knots and links. 

First, we consider knots and links belonging to F(p, g, r, t, s, ii). Note that, while the knots and links in 
this family have at most six non-adjacent rows containing p^q^r^s^t and u signed crossings respectively, it 
does not follow that the minimal crossing number of such a knot or link is |p| + |<7| + |r| + |s| + |t| + |ix|. If the 
knot or link is not alternating, it is quite possible that the number of crossings can be significantly reduced. 
Thus, a priori^ there is no reason to believe that the number of knots and links in this product family should 
grow linearly with n^. 
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Definition. A link diagram is called reduced if it does not contain any 'removable' or 'nugatory' crossings. 
A reduced alternating link diagram is a link diagram that is reduced and also alternating. 

Murasugi [47] and Thistlethwaite [48] proved that any reduced alternating diagram has a minimal number 
of crossings. Buck and Flapan [37] used this to show that for a twist knot C(r, s) if r and s have the same 
sign, then MCN(C(r, s)) = \r\ + \s\ — 1, and if r and s have opposite sign then MCN(C(r, s)) = \r\ -\- \s\. 

To prove our result, we consider a Hara-Yamamoto projection: a projection of a knot or a link in which 
there is a row of at least two crossings and which has the property that if this row is cut off from the rest of 
the projection and the endpoints are resealed in the two natural ways, then both resulting projections are 
reduced alternating (see Figure [26]). Hara and Yamamoto showed that any Hara-Yamamoto projection has 
a minimum number of crossings [49] . 

We make use of the following theorem proved by Lickorish and Thistlethwaite, in [50] : 

Theorem. (Lickorish- Thistlethwaite) If a link L admits an n-crossings projection of the form as in 
Figure\3^a) with k = and each Ri a reduced alternating rational tangle diagram with one crossing between 
the two arcs at the bottom of each Ri and at least one more crossing. Then L cannot be projected with fewer 
than n crossings. 

We refer to such a projection as a reduced Montesinos diagram. We can deduce from the theorem that any 
projection of a knot or link that is a reduced Montesinos diagram has a minimal number of crossings. 

We begin with two lemmas. 

Lemma 2. The number of distinct knots and links in the product family illustrated in Figure\2^ with MCN= 
n grows linearly with n^ . 

Proof. Fix n and suppose i^ is a knot or a link projection in the family of Figure [2al with minimal crossing 
number n. Then this projection has \p\ + \q\ + \r\ + \s\ + \t\ + \u\ crossings. We divide the proof into three 
cases: 

Case 1. K reduced alternating or reduced Montesinos: If projection K is reduced alternating or a reduced 
Montesinos diagram, then \p\ + l^l + \r\ + \s\ + |t| + \u\ = n. 

We now show that if K is not reduced alternating or a reduced Montesinos diagram then it is ambient 
isotopic to one of 121 possible projections which have minimal number of crossings. 

Case 2. K can be isotoped to a reduced alternating or reduced Montesinos diagram: Figure [271 illustrates 
an example of how to reduce the number of crossings in a projection K that is not reduced alternating 
or reduced Montesinos. Observe that for the link in Figure [271 the part containing the rows of r and q 
crossings is alternating if and only if r and q have opposite signs. Similarly for the section containing the 
rows of r and u crossings. If r and q have the same sign, then by moving a single strand, this part of the 
knot or link becomes alternating. This isotopy removes a crossing from both the r row and the q row and 
adds a single new crossing. Thus we reduce this part of the diagram from having \r\ + \q\ crossings in a 
no n- alternating form to having (|r| — 1) + (\q\ — 1) + 1 crossings in an alternating form. Similarly, for the 
middle and left hand side of the diagram, a non- alternating diagram having \r\ + \u\ crossings is reduced to 
an alternating for having (|r| — 1) + (\u\ — 1) + 1 crossings. So overall, our original non- alternating diagram 
having \r\ + \q\ + |^i| crossings is reduced by an isotopy of two strand movements to a reduced alternating 
diagram having (|r| — l) + (|g| — l) + (|^i| — l) + 2 = n crossings. Note that we can also change non- alternating 
diagrams to reduced Montesinos diagrams using strand movements like these. 

Case 3. K cannot be isotoped to either a reduced alternating or reduced Montesinos diagram: There are also 
cases where we cannot obtain a reduced alternating or reduced Montesinos diagram via strand movements 
of K. We describe a specific example illustrated in Figure [26l Let i^ be a knot or link diagram in our 
family F(p, g, r, 5, t, ii) with t,p = l,r > 1,5 = l^q^u < —1. In its original form, the projection has 
(r — 1) + {\u\ + 1) + (l^l + 1) crossings. The projection on the left of Figure [26] is Hara-Yamamoto because 
the projections (on the right) obtained by resealing the endpoints are both reduced alternating. Thus, this 
projection has a minimum number of crossings. 
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We consider 121 cases according to the values oip^q^r^s^t and u^ and show that in ah but the Hara-Yamamoto 
case K^ the initial diagram is isotopic to a diagram that is either reduced alternating or reduced Montesinos 
and hence has minimal crossing number. Since there are so many cases, we display the results in Tables 2 to 
5 rather than discussing each case individually. We make the following notes and observations with respect 
to the Tables. 

• To compute Tables 2, 3, 4 and 5, the family of knots and links F(p, q^ r, s, t, u) is broken down into 
seven smaller subfamilies, shown in Figure [25l We count knots and links belonging to subfamilies de- 
notedby F5'^(0,g,r, s,t,ii) with \r\ > 0, \t\ > 1, ^5-2(^1,^,^,5,^1,1^) with \r\ > 1, ^5-3(^1, g, r, 5, t, i^) 
with |r|, \t\ > 1, Fs^{p^q^r^s^t^u) with |t|, |r|, \p\ > 1, T(2,r), K{q^s^u) and the unlink (subfamilies 
illustrated with a double arrow in between indicate that they give the same knots and links, so we 
count only one of them). Observe that in subfamily ^5-2 (±1, ^, r, s, ±1, u)^ the rows of crossings con- 
taining u and q crossings are interchangeable, so we treat the variables u and q as interchangeable. 
Similarly, in subfamily ^5-3(^1, g, r, s, t, i^), the tangles Ri and R2 are interchangeable, so we treat 
the variables r and t as interchangeable and s and u as interchangeable. A similar consideration is 
given to subfamily Fs^ (0, g, r, 5, t, u) and Fs^ (p, <7, r, s, t, u). For certain specific values of p, g, r, s, t 
and ii, we may obtain a trivial knot or link. However, we do not specifically exclude these cases from 
our Tables. 

• For all tables: column two lists the form of the knot or link which has a minimal number of crossings 
(e.g. reduced alternating). If the knot or link is isotopic to a clasp, pretzel, or torus knot or link or 
a composition of any of these, we list the specific form. Also, if one of the knots or links contains 
a trivial component, we use the shorthand +0 to indicate this. Column three shows the number 
of strand movements needed to achieve a diagram with minimal number of crossings. We write an 
expression with (±tt?) ^^ ^^e end to indicate that there may be tl more or less number of strand 
movements, depending on the values of the relevant variables. Column four shows the MCN of the 
corresponding reduced alternating or reduced Montesinos conformation. The MCN is listed as an 
unsimplified function of p, g, r, s, t and u to help the reader recreate the isotopy taking the original 
form to the minimal crossing form. As a consequence, in column four we write an expression with 
(±tl?) at the end to indicate that the MCN may be tl smaller or bigger. For example, when the 
minimal crossing form of the knot is a clasp knot C(r, s), if we do not know the signs of r and s, on 
column two, we write an expression with +1? and in column three we write an expression with —1?, 
see Figure [271 In column five we obtain the upper bounds for the number of links in each case by 
expressing MCN= n as a sum of nonnegative integers. This enables us to find an upper bound for 
the number of knots and links with MCN= n in each case. Note that the upper bounds given are 
intended to be simple rather than as small as possible. In particular, a number of our cases overlap, 
and thus some knots and links are counted more than once. 

• For all tables: We consider a knot or link and its mirror image to be of the same link type, and 
hence we do not count both. Thus without loss of generality, we assume that r > 0. 

There are 118 nontrivial cases in Tables 2, 3, 4 and 5. Any knots and links appearing more than once in the 
tables are counted only once. Thus there are at most 111 distinct families of knots and links listed in the 
tables. The number of knots and link in each of these families is bounded above by 4n^ (in fact, for most 
of the cases there are significantly fewer than 4n^ knot and link types). It follows that for a given n, the 
number of distinct knots and links in the product family F(p, g, r, s, t, ii) which have MCN= n is bounded 
above by 4n^ x 111 = 444n^. In particular, the number of distinct knots and links with the form of Figure 
[2al which have MCN= n grows linearly with n^ . D 

We now consider product knots and links belonging to Gl or G2. These come about as products of recom- 
bination with a tyrosine recombinase on a recombinase-DNA complex with conformation (74 illustrated in 
Figure [TSl and the post-recombinant conformation B = B6. 

Lemma 3. For a fixed n there exists at most one knot type in Gl with MCN equal to n. Similarly, for G2. 

Proof. Gl is reduced alternating, and hence has minimal number of crossings. Thus, it is clear that the 
MCN(Gl) = 4 + |'u| = n. Similarly, G2 is reduced alternating and thus has minimal crossing number, so 

MCN(G2) = 3 + V = n. D 
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We are now ready to prove Theorem 4. 

Theorem 4. The number of putative knots and links resulting from site-specific recombination on a substrate 
that is the twist knot C{2,v) with MCN equal to n grows linearly with n^ . 

Proof. There are at most 111 + 2 = 113 non-trivial, distinct famihes of knots and Hnks that are putative 
products of site-specific recombination on a substrate that is C(2,'u); 111 belong to the family of small 
Montesinos knots and links illustrated in Figure [2al (the number of such knots and links is bounded above by 
4n^) and two belong to the families Gl and G2 illustrated in Figures [2bl and [2cl It follows that for a given 
n, after recombination on a twist substrate, the number of distinct knots and links which have MCN= n is 
bounded above by 4n^ x 113 = 452n^. In particular, the number of distinct knots and links that belong to 
the families Gl, G2 and/or F(p, g, r, 5, t, u) that have MCN= n grows linearly with n^ . D 

It follows from Theorem |4] that the proportion of all knots and (two-component) links which are contained in 
the families F(p, q^ r, s, t, u)^ Gl and G2 decreases exponentially as n increases. Thus, for a knotted or linked 
product, knowing its MCN and that it belongs to one of these families allows us to significantly narrow the 
possibilities for its precise knot or link type. The model described herein thus provides an important step in 
characterizing DNA knots and links which arise as products of site-specific recombination. 

6.2. Products whose MCN is one more than the substrate. We now prove a more directly applicable 
theorem. Site-specific recombination often increases the MCN of a knotted or linked substrate by one, see 
for example Table 1 in [38]. If the substrate is C(2, v), with minimal crossing number m and the product of 
a single recombination event has MCN= ?n + 1, then we can further restrict the resulting knot or link type. 
Recah the the MCN(C(2, v)) = 2 + \v\ for v < and MCN(C(2, v)) = I ^ v ioi v > 0. 

We remark that site-specific recombination that increases the minimal crossing number of the product by 
one could result in a change in the number of components. For example, if the substrate is C(2, 2) which 
is a one component link (a knot) one of the possible products according to Theorem 5 is T(2,4), a two 
component link. 

Theorem 5. Suppose that Assumptions 1, 2, and 3 hold for a particular recombinase-DNA complex with 
substrate J = C(2, v)^v ^ and denote the MCN{J) = n > 0. Let L be the product of a single recombination 
event and suppose MCN{L) = n-\-l. Then: 

Ifv >0,Lis either: C(2, v+1), C(2, -v), C(-2, v), C(-2, -(1+^)), C(3, v), T(2, ±(2+^)), Fs, (0, q, 2, s, 2, u) 
where u -\- s = v, Fs^ (il, il, 2, 5, ±1, u) where u -\- s = v or s ^ or Fs^ (±1, 0, 2, 5, 2, u) where u -\- s = v 
and s^u ^ 0. 

If V < L is either: C(2, 2 + |v|), C(2, -(1 + |v|)), C(-2, 1 + |v|), C(-2, -(2 + |v|)), C(3, v), C(-4, v), 
T(2, ±(3 + I^D) or Fs^ (±1, ±1, 2, s, ±1, u) for u ^ s = v. 

Table [6] summarises this information. 

Proof Firstly, note that n > 2. Note also that if v = 1 then n = 2, but there are no nontrivial knots 
with minimal crossing number equal to 2, so the substrate must be the unknot, which is considered in J37] 
and [38]. We exclude the case when v = 1. 

For n = 3, C(2, v) is the trefoil knot 3i (i.e., v = —1) so L must be the Figure of eight knot 4i = C(2, —2) or 
the torus link T(2, ±4), since these are the only knots and links with minimal crossing number equal to 4. 

Now assume that n > 4, that is v < — 2 or v > 3. By Assumption 1, there is a projection of J such 
that B n J has at most one crossing. Since J = C{2,v), the proof of Lemma 1 shows that C D J has the 
forms CI, C2, C3, C4 or Cb (Figure [18]). When C D J has form CI, then u-^ s = v. By Assumption 3 and 
Figures [5] and [161 the post-recombinant form of 5 fl J is one of those illustrated in Figure [161 Thus any 
knotted or linked product L has one of the forms illustrated in Figure [iTl 
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Now, suppose that L has one of the forms ihustrated when C fl J has form C2, C3, C4 or Cb. L can- 
not be either T(2, 2)ttC(2,i;) or T(2, -2)ttC(2, v) because MCN(L) = n + 2. L can certainly not be a Hopf 
Hnk, an unknot or C{2,v) with a trivial component, since MCN(L) ^ n -\- 1. Finally, L cannot be C(4, v) 
because if v > 0, MCN(L) = 3 + v and of v < 0, MCN(L) = 4 + |v|. 

If L = (7(2, n) then n = 1 + '^ or — 'u when 'i; > or n = 2 + |'y| or '^ — 1 when 'i; < 0. If L = C(k^v) 
then A; = 3 Vv, A; = -2 for V > and A; = -4 for v < 0. If L = T(2, n) then for v > n = ±{v + 2) and for 

V <{) n = ±(3+|v|). 

If L = ^5^(0, g, 2, s, t, ii) for s + ii = V for some value of t then L has a projection in this product sub- 
family with t = ±2. So we can assume L has a projection of the form Fs^ (0, g, 2, s, ±2, i^) with i^ + 5 = 'i;. 
If t = -2, for V < MCN(L) = 1 + |^x| + |s| + 1 = 3 + |v| = n + 1 so this case is possible and for 

V > MCN(L) = |-2|+^/ + l + s = 3 + V7^n + lso this case in not possible. If t = +2, for 

V < MCN(L) = 2 + |ii| + 2 + 1^1 =4+1^1 7^ n + 1 so this case is not possible and for t; > 
MCN(L) = 1 -\- u -\- 1 -\- s = 2 -\- V = n + 1 so this case is possible. So L = ^^^(O, g^, 2, 5, — 2,ix) only 
for V < and L = F(2, 5, 2, u) for v > 0. 

Now suppose that L has one of the forms illustrated when C H J has form CI. Suppose L has a pro- 
jection of the form ^5-2 (±1, ^, 2, s, ±1, u) with ii + 5 = v. For v > and s = 0, g must be 0, however, this 
is isotopic to T(2,ii), which has MCN= v^ thus this is not allowed. For v > and 5 7^ 0, g = ±1 and for 

V < 0^ q = ±1. That is, for 'z; > 0, L = ^5-2(^1, ±1, 2, s, ±1, ix) for s 7^ and s -\- u = v^ and for 'i; < 

L = Fs^ (±1, ±1, 2, s, ±1, u) for u^s = v. 

If 1/ has a projection of the form ^5-3(^1, g, 2, 5, t, ix) for some value oft, then L has a projection in this product 
subfamily with t = ±2. Thus we now assume that L has a projection of the form ^5-3(^1, g, 2, s, ±2, i^) with 
u^s = v. IH = 2J0TV < 0, MCN(L) = 2 + |^/|+2 + |s| + |g| =4+|v| + |g| > n+1 for any value of g, so this case 
is not possible, for 'u > MCN(L) = l+ii+l + s+l^l = 2+v+|(7|, so this case is possible for q = 0. In this par- 
ticular case, if one of ii or 5 equals 0, then MCN(I/) = 3+'z;+|(7| > n+1 for any value of g, so L = F(g, 2, 5, 2, u) 
only for V > smds.u^ 0. Ift = -2, for 'u > MCN(L) = | - 2| +^/ + l + s+ |g| = 3 + v+ |g| > n + 1 for any 
value of g, so this case is not allowed and for v < MCN(L) = | — 1| + |ii| + 2 + |5| + |g| = 3 + |v| + \q\ so this 
case is possible for q = 0. In this particular case, if one of ii or s equals 0, then MCN(L) = 4+|v| + |(7| > n + 1 
for any value of g, so L = F(g, 2, 5, —2, u) only for v < and s^u ^ 0. In summary, L = ^5-3(^1, 0, 2, s, —2, u) 
for 1; < and L = F(0, 2, s, 2, ix) for v > 0, both with ix, s 7^ are the only possibilities allowed. 

Finally, suppose L has a projection of the form Gl. Then v < and MCN(L) = 4+|v| > 3+|v| = 
n + 1, thus L cannot have this conformation. Suppose has a projection of the form G2, then v > and 
MCN(L) = 3 + '^; > 2-\-v = m-\-l and so L cannot have this projection either. This completes the proof. D 

7. Applications of our model 

We discuss how the model developed here can be a useful tool to analyse previously uncharacterized data in 
a variety of setttings in [39] . 

These applications fall into four broad categories: Application 1: our model can help determine the or- 
der of products of processive recombination. Application 2: in the common situations where the products 
of site-specific recombination have MCN one more than the MCN of the substrate, our model can help 
reduce the number of possibilities for these products. Application 3: our model can help predict products of 
processive and distributive recombination. Application 4-' our model can help distinguish between products 
of processive and distributive recombination. 

To give a fiavour of how to use this model, we conclude with a simple example of Applications 1 and 
2. 

Application 1. Our model can be used to help understand processive recombination mediated by a serine 
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recombinase. Using Figure [T5l and Table 1 which summarize the conclusions of Theorem 2, we can narrow 
the possibilities for the sequence of products in multiple rounds of processive recombination. Suppose that 
for a twist knot substrate of the form C{—2^v) with v ^ 0^ experimental conditions minimize distributive 
recombination and the products of multiple rounds of processive recombination are twist knots, unknots (or 
(7(— 2, s) + O) and the connected sum of a torus knot and a twist knot C(— 2, s)tlT(2, m). Then from Figure 
[T5lfa) we can determine that recombination happens from the twist knot substrate to the clap knot with a 
trivial component C(— 2, v) -\- O, product of the first round of recombination, to the connected sum of torus 
knots and clasp knots, product of the second round of recombination. Moreover, any products of further 
rounds of recombinations are connected sums of the form C(— 2, 'u)jlT(2, m) with increasing minimal crossing 
number. 

Application 2. We now demonstrate an application of Theorem 5. Suppose the twist knots C(2, 5) and 
C(2, 7) (which have MCN equal to 6 and 8 respectively) are used as substrates for a site-specific recombin- 
ation reaction with a tyrosine recombinase, where experimental conditions eliminate distributive recombin- 
ation and products are knots and links with minimal crossing number 7 and 9. In this case the minimal 
crossing number is not sufficient to determine the knot type, since there are 7 knots, 8 two-component links 
and 1 three-component link with MCN=7 and 49 knots, 61 two-component links and 22 three-component 
links with MCN=9. However, we can use Theorem 6 and Table 7 to significantly reduce the number of 
possibilities for these products. It follow from Theorem 6 that the possible seven-crossing products are 7i, 
^2, ^3, 76, 72, , 73, or 3itl4i; and the possible nine-crossing products are 9i, 92, 93, 98, 9ii, 9^, , 9^0, SiflSi, 
or 4ijl52. In Table [71 we show how to do this. We have reduced from 16 choices for 7-noded knots to just 
7, from 132 possibilities for 9-noded knots and links to just nine possibilities. Thus, Theorem 6 can help to 
significantly reduce the knot and link type of products of site-specific recombination that add one crossing 
to the substrate. 
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Recombinase type Substrate Product 



Tyrosine C{2,v) unknot, C{r,s) for r = 1,2,3,4, T(2,m), Hopf link, 

T(2, m)ttC(2, s), F(p, g, r, 5, t, ix) with \t\ = 1 or 2,|p| < 1, knots or 
links in families Gl or G2 

Serine C(2, v) C(r, 5), T(2, m), T(2, m)ttC(2, 5), F(j9, g, r, s, t, ii) with |p| < 1 and 
t^O 

Table 1 . Products of non-distributive recombination predicted by our model. 



Values of p, q,r, s,t,u for r > Minimal crossing form Strands moved MCN as a sum of non-negative integers Upper bound on 

number of links 

p = 0,t,r > 2,u,s < —1 Reduced Montesinos 

p = 0,t,r > 2,u,s > 1 Reduced alternating 

p = 0,t < —2, r > 2,u,s > 1 Reduced alternating 

p = 0,s> l,u < —1, wlog \u\ < \s\ Reduced alternating 

p = 0^s = -u T(2,t + r) 

p = 0^s,u = r(2,t + r) 

p = 0^r = l,t = ±l T(2, (w ± 1) + (s + 1)) 

p = 0^r = l C(t,w + 5 + l) 

p = 0,u = 0,t,r >2,s > 1 Reduced alternating 

p = 0,u = 0,t < —2, r > 2, 5 < — 1 Reduced alternating 

p = 0,u = 0,t < —2, r > 2, s > 1 Reduced alternating 

p = 0,u = 0,t,r>2,s< —1 Reduced alternating 

p = 0,t = T(2,r) \t\ T 

p,r,t = unlink 






t+|w|+r+|s| 


n^ 


2 


(t - 1) + (w - 1) + (r - 1) + (s - 


- 1) + 2 n3 


1 


(r-l) + (s-l) + |t|+w+l 


2n3 


\u\ +1? 


|t|+r+(s- |w|) -1? 


4n2 


1^1 


\t + r\ 


1 





\t + r\ 


1 





|(u±l) + (s + l)| 


1 


1? 


|f| + |u + 5 + l| -1? 


4n 


2 


(t - 1) + (r - 1) + (s - 2) + 2 


n2 


1 


(|t|-l)+r + (|5|-l) + l 


n2 


1 


l^l + (^-l) + (^-l) + l 


n2 





t + r+|s| 


n2 



j9,r,t = ±l K{u±l,s±l,q±l) (|n ± 1|) + (|s ± 1|) + (|g ± 1|) 8n 

Table 2. Theorem 4: The minimal crossing forms of knots and links in subfamilies 
^^i (O7 ^7 ^7 ^5, t, ii), 5, 6 and 7 illustrated in Figure [25l 



2 
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Values of p, q, r, s,t,u ior r > 


Minimal crossing form 


Strands moved 


MCN as a sum of non-negative integers 


Upper bound on 
number of links 


p,t = ±1, u,q = 

p,t = ±l,r = 1 

p,t = ±l,r = l,s = 1 

p,t = ±l,r > l,g = 

p,t = ±l,r > l,uq = —1 

p,t = ±l,r>l,uq=l,s = 

p,t = ±l,r > l,u > l,q = l,s > 



p,t = ±l,r>l,u = q = l,s<0 

p,t = ±l,r > l,u < -l,q = 

-l,s > 

p,t = ±l,r>l,u,q<0,s<0 

p,t = ±l,r>l,u,q>l,s = 

p,t = ±l,r > l,u < -l,q > 

l,s = 

p,t = ±l,r > 1, \u\ > 1, \q\ = 

l,s = 

p,t = ±l,r > l,qs = -1 

p,t = ±l,r > l,u> 0,q = l,s < 



p,t = ±l,r > l,w < -2,g = 

l,s < -2 

7?, t = ±1, r > 1, w, g > 0, s = 1 

p,f = ±l,r > l,u < -l,q > 

0,s = 1 

p,f = ±l,r > l,w < -l,g = 

l,s > 1 

p,f = ±l,r > l,u > l,g = 

-l,s < 

p,t = ±l,r > l,ii > l,g = 

-1,5 = 2 

p,f = ±l,r > l,p > l,g = 

-l,s > 2 

p,t = ±l,r> 1, |w|,|g| > l,s < 

p,t = ±l,r > 1, \u\,\q\ > l,s > 1 

p,t = ±l,r > l,w < -l,g = 

-2,s = 1 

p,t = ±l,r > l,u,q < -2,s = 1 


C(r, s)+0 

K{u,s + l,q) 

T(2,u)tJT(2,g) 

T(2,w)tJC(r,5) 

r(2,r) 

C(±2,r) 

Reduced alternating 


1? 





1? 





1 


|r| + |5| -1? 

w| + |s + l| + \q\ 

H + \q\ 

\u\+r+\s\ -1? 

r 

2 + r -1? 

w + (r - 1) + (5 - 1) + 2 n2 


4n 

4n2 

2n 

4n2 

1 

2 


Reduced alternating 
Reduced alternating 

Reduced alternating 
Reduced alternating 
Reduced alternating 


1 
2 


2 

1 


r + (-s- l) + 2 

-w + (r - 1) + (s - 2) + 2 n2 

—u — q -\- r — s 

{u - 1) + (g - 1) + (r - 2) + 2 n^ 

-w + (g-l) + (r-l) + l 


n2 


C{r±l,u) 





-u + (r±l) -1? 


4n 


T(2,r + u±l) 
Reduced alternating 


1 
1 


\r + u±l\ 

w + r + (-s -1) + 1 


1 
n2 


Reduced alternating 

Reduced alternating 
Reduced alternating 


1 

1 
1 


(-w-2)+r+(-s-2) + l 

w + g + (r - 1) + 1 
{-u-l)+q+{r-l) 


n2 

2 

n 
n2 


Reduced alternating 
Reduced alternating 
Trivial 


2 
1 
2 


(-w-l) + (r-l) + (5-l) + 2 

(u - 1) + r - 5 + 1 

0/n 


n2 



Reduced alternating 

Reduced Montesinos 
Reduced Montesinos 
K{u,r- 1,2) 


3 



1 
1 


(w - 2) + (r - 1) + (5 - 3) + 2 

\u\ -\- \q\ -\- r - s 

|u| + |g| + (r-l) + (s-l) + l 

-w + 2 + (r - 1) 


n2 

4n3 
4n3 
n 


HaraYamamoto 


1 


-u + (-q- 1) + (r- 1) 


n2 



Table 3. Proof of Theorem 4: The minimal crossing forms of knots and hnks in subfamily 
^5-2 (±1, ^, ^, s, ±1, 1^) illustrated in Figure [25l 
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Values of p, q,r, s,t,u for r > 



Minimal crossing form Strands moved MCN as a sum of non-negative in- 
tegers 



Upper bound on 
number of links 

~^ 



2n 



1,-1 < u,s < Reduced alternating 



p=l,t,r>2,q = - 

0, not both u,s = 

p = l,t,r > 2,u,s,q < —1 Reduced 

p = l,t,r > 2,u,s,q = 1 Reduced 

p = l,t,r > 2,u = 0, s,q = 1 Reduced 

p = l,t,r > 2,u = 0, s,q > 1 Reduced 

p = l,t,r > 2,u,s,q > 1 Reduced 

p = 1, t,r > 2,-1 < u,s < 0,q = Reduced 

1 not both u,s = 

p=l,t,r>2,u,s<-2,q>2 

p = l,t < -2,r > 2,0 < w < 

1,-1 <s<0,q=-l 

p=l,t,s,q< -2,r,u>2 

p = l,r > 2,t < -2,-1 < u,s < 

0,q = l 

p=l,r>2,t<-2,u,s<-2,q> 

2 

p=l,t,r>2,s = 0u = 0,q=-l 

p = l,t,r > 2,u = 1, s,q = —1 

p = l,t,r,u > 2,q, s < —2 

p = l,r > 2,t < -2,u = 0,s = 

l,q = -l 

p = l,r > 2,t < -2,u,s = l,g = 

-1 

p = l,r,u,s>2,t,q<-2 

p=l,t,r>2,u = 0,q=l,s = -l 

p = l,t,r > 2,u,q = l,s = —1 

p = l,t,r,u,q >2, s < —1 

p = l,r > 2,t < -2,u = 0,0 < 

s<l,q=l 

p = l,r > 2,t < -2,0 < s < 

l,u,q = l 

p=l,r>2,t<-2,u,s,q<-l 

p=l,t,r>2,u = l,s = 0,q = -l 

p = l,t,r > 2,u, s = l,q = —1 

p = l,t,r,s,u > 2,q < —1 



Montesinos 
alternating 
alternating 
alternating 
alternating 
alternating 



Reduced Montesinos 
Reduced alternating 

Reduced Montesinos 
Reduced Montesinos 

Reduced Montesinos 

Reduced alternating 
Reduced alternating 
Reduced Montesinos 
Reduced alternating 

Reduced alternating 

Reduced Montesinos 
Reduced alternating 
Reduced alternating 
Reduced Montesinos 
Reduced alternating 

Reduced alternating 

Reduced Montesinos 
Reduced alternating 
Reduced alternating 
Reduced Montesinos 



p = l,r > 2, t < —2,0 < u < Reduced alternating 

l,s = -l,q = 1 

p = l,r,u > 2,t < —2,q > 1, s < Reduced Montesinos 

-1 

p = l^r = s = l T(2, g)tJC(t, u) 



p = l^q = 

p = l,q = 0,us = —1 
p = l,q, s = 
p = l,u,s = 
p = l,u, s,q = 



T(2,r)tJT(2,f) 
T{2,r)iC{t,u) 
C{t + r, q) 
T(2,t + r) 
unknot 



2 
1 
2 
2 

2 



1? 

2? 

2 

1? 
1? 


kl 



|r| + |.| + |t| + |u| + (|g|(±l)) 

|r| + |.| + |t| + |u| + (|g|(±l)) 
r - 1) + (f - 1) + 3 
r - 2) + (t - 3) + 1 
r-l) + (s-l)+t + l 
r-l) + (t-l) + (5-l) + (w-l) + 2 
- 1) + (t - 1) + 1 

r + t+|w| + |s| + g 
r+ \t\ +U+ \s\ 

r+\t\+u+\s\ + \q\ 
r+|t| + (|w|-l) + |s| + l 



(r - 1) + (t - 1) + 1 
(r - 1) + (t - 2) + 1 

r+{t-l) + {u-l) + \s\ + \q\ 
\t\+r 

\t\+r-l 

(r-l) + (s-l) + |t|+w+|g| + l 

t + r + 2 

(t - 1) + r + 2 

{t-l) + {u-l)+r + \s\+q + l 

|t|+r + 2 

(|t|-l) + r + 3 



2n^ 



2n^ 



r + (|t| - 1) + {\u\ -l) + \s\ + q + l 2n4 



2n 
2n 

2n^ 
2n 



2n 
2n 

2n4 
2n 

2n 



(|t|-l) + (|u|-l) + |8| + |g|+r+l 


2n4 


(t - 1) + r 


2n 


(t - 1) + (r - 1) + 1 


n 


(t - 1) + (r - 1) + (u - 1) + (5 - 


n^ 


i) + kl + i 




(t - 2) + (r - 1) + 1 


2n 


r + ii + g+|t| + |s| 


2n^ 


(|g| + l) + (|f| -l?) + (|u| - 


4n2 


1?) +1? 




(|r| -1?) + (|5| -l?) + (|t| - 


8n^ 


l?) + (|w| -1?) + 1 +2? 




l^l + kl + 1 


2n 


{\t\ -l?) + (|w| -l?) + |r| + l +1? 


An' 


(|t+r| -l?) + ((|g| + l)-l?) +1? 


An 


|t + r| + l 


1 









Table 4. Proof of Theorem 4: The minimal crossing forms of knots and hnks in subfamily 
^5-3 (±1, g, r, s, t, u) illustrated in Figure [25l 
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Values of p, q,r, s,t,u for r > 



Minimal crossing form Strands moved MCN as a sum of non-negative in- Upper bound on 

tegers number of links 



r,t,p > 2,-1 < 

0, no two oi u,s,q = 
r,t,p > 2, u,s,q < —1 
r,t,p > 2,0 < 

1, no two oi u,s,q = 
r,t,p >2,u,s,q >2 



u,s,q < Reduced alternating 



li, s, q 



r > 2,t,p < -2,w,g > l,s < -1 
r > 2,t,p < -2,0 < w,g < 
1, — 1 < s < no two of u,s,q = 
t,r > 2,p < -2,w, s < -l,g > 1 
t,r > 2,p< -2, -1 < w,s < 0,0 < 
g < 1, no two oi u,s,q = 
t,r>2,p<-2,u = 0,s,q>l 
t,r > 2,p < —2,u,s,q > 1 



Reduced Montesinos 
Reduced alternating 

Reduced Montesinos 

Reduced Montesinos 
Reduced alternating 

Reduced Montesinos 
Reduced alternating 

Reduced alternating 
Reduced alternating 



r > 2, t,p < —2, u,q < —1, < s < Reduced alternating 

1 

r >2,t,p < —2, u = 0,s,q < —1 Reduced alternating 

r >2,t,p < —2, 5,g>l,0<ii<l Reduced alternating 

r >2,t,p < —2, u,q > l,s = Reduced alternating 

r > 2, t,p < —2, q < 1,-1 < s,u < Reduced alternating 



p,r,t > 2,u,s > 0, g < Reduced Montesinos 

0, no two oi u, s,q = 

p,r,t > 2,u > 0, g, 5 < Reduced Montesinos 

0, no two oi u, s,q = 

p,r > 2,t < — 2,ii, s > 0, g < Reduced Montesinos 

0, no two oi u, s,q = 

p,r > 2,t < —2,u,q > 0, s < Reduced Montesinos 

0, no two oi u, s,q = 

t,r > 2,p < —2,u,s > 0, g < Reduced Montesinos 

0, no two oi u, s,q = 

t,r > 2,p < — 2, s > 0,ii, g < Reduced Montesinos 

0, no two oi u, s,q = 

t,r > 2,p < —2,u > 0, s,g < Reduced Montesinos 

0, no two oi u, s,q = 

r > 2,t,p < —2,u,s > 0, g < Reduced Montesinos 

0, no two oi u, s,q = 

p,r > 2,t < —2,s,u,q < Reduced Montesinos 

0, no two of u, s,q = 

r > 2,t,p < —2,u > 0,s,q < Reduced Montesinos 

0, no two oi u, s,q = 

r > 2,t,p < — 2, s > 0,u,q < Reduced Montesinos 

0, no two oi u, s,q = 

r > 2,t,p < —2,s,u,q < Reduced Montesinos 

0, no two oi u, s,q = 

r = l,s = -l C{t,u)iC{p,q) 



: l,s: 

1,5: 



-1,7. = ±1 



T(2,g)tiC(t,u) 
T(2,u)BT(2,g) 



1 
1 

1 

2 

1 

1 

1 

3 

2 

2 

2 

1 

1 

3 

2 

2? 

1? 





t+\u\+r+\s\+p+\q\ 

t+\u\ +r+ |s|+p+ \q\ 

(t-l) + (|u|-l) + (r-l) + (|.|. 

l) + (p-l) + (|g|-l) + 3 

(t-l) + (|u|-l) + (r-l) + (|.|. 

l) + (p-l) + (|g|-l) + 3 

\t\ + \p\ + \s\+r + u + q 

\t\ + \p\ + \s\+r + u + q 

\u\^\p\ + \s\+r + t + q 
\u\ + \p\ + \s\+r + t + q 



2n^ 

2n^ 
2n^ 

2n^ 

2n^ 
2n2 

4n2 



t + (r - 1) + (s - 1) + IpI + 3 + g An^ 

(t - 1) + (w - 1) + (r - 1) + (s - An^ 

l) + |p| + 2 + g 

(|t|-l) + (|u|-l) + (|p|-l) + (|g|- 2n^ 

l) + |5| + 2 + r 

|f| + |u| + (|p|-l) + (|g|-l) + |5|+2+r 2n5 

(r-l) + (5-l) + |p|+g+|t|+w + l 2n^ 

r+\p\ + q+\t\+u 2n^ 

(|p|-l) + (|g|-l) + |5|+r+|5|+f+l An^ 

(t - 1) + (w - 1) + (r - 1) + (s - 3n5 

l)+p+\q\+2 

(t-l) + (u-l) + |r| + |s|+7?+|g| + l 3n5 

|f|+ii+(r-l) + (5-l)+p+|g| + l 2n^ 

|f|+ii+(p-l) + (g-l) + r+|5| + l 2n^ 

(|t|-l) + (|u|-l) + (p-l) + (g- 2n^ 

l) + (r-l) + (s-l)+3 

f + w + {\p\ - 1) + {\q\ - 1) + (r - 2n^ 

l) + (s-l) + 2 

r + 5 + {\p\ - 1) + (|g| - 1) + (t - 2n5 

l) + (w-l)+2 

|f|+u+(|p|-l) + (|g|-l) + (r- 2n5 

l) + (5-l) + 2 

7^+kl + (l^|-l) + (k|-l)+^+|5| + l 2n5 

|f|+u+(|p|-l) + (|g|-l)+r+|5| + l 2n5 

(|t|-l) + (|u|-l) + (|p|-l) + (|g|- n^ 

l) + (r-l) + (s-l)+3 

(|t|-l) + (|n|-l) + (|p|-l) + (|g|- n^ 

l) + r + 5 + 2 

(|t| -l?) + (|u| -l?) + (|p| - 8n2 

l?) + (|g| -1?) +2? 

(|t| -l?) + (|u| -l?) + (|g|± 8n2 

1) (+1?) 

(|tx|±l) + (|g|±l) 2n 



Table 5. Proof of Theorem 4: The minimal crossing forms of knots and hnks in subfamily 
Fs^ (p^q^r^s^t^u) illustrated in Figure [25l 
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When 



L 



for 



u> 


C(2,n) 




n = 1 ^- V or —V 


v<0 






n = 2 + 'z; or —{\v\ + 1) 


w > 


C(-2,n) 




n = 'u or —(1 + v) 


w < 






n = 1 + |v| or -(|v| +2) 


Vw 


C(fc,t;) 




A: = 3 


t; > 






A: = -2 


v< 






fe = -4 


v>Q 


r(2,n) 




n = ±(2 + v) 


V <Q 






n = ±(3+|v|) 


v>Q 


Fs,(0,g,2,s,2,^ 


) 


U^ S = V 


v>Q 


Fs,(±l,(Z,2,s,±l 


u) 


ii + s = 'y, s 7^ 0, g = ±1 


V <Q 


Fs,(±l,(Z,2,s,±l 


u) 


ii + s = 'y,g' = ±1 


v>0 


Fs3(±l,g,2,s,2, 


u) 


ii + s = 'U,s,2i7^0,g = 



Table 6. Summary of Theorem 5. 



Products with 7 crossings 



C(2,6) = 72* 




C(2,-5) = 72* 




C(-2,5) = 72* 




C(-2,-6) = 72* 




C(3,5)=52 




T(2,±7) = 7i* 




Fs, (0,9,2,1,2, 4) = 


_ 72* 
- '3 


Fs, (0,9,2,2, 2, 3) = 


- '3 



Fs,(±l,l,2,l,±l,4) = 73* 
Fs,(±l,-l,2,l,±l,4) = 5i 
Fs,(±l,l,2,2,±l,3) = 7i* 
Fs2 (±1, -1, 2, 2, ±1, 3) =unlink 
Fs,(±l,l,2,3,±l,2) = 76* 
Fs^ (±1, -1, 2, 3, ±1, 2) =unknot 
Fs,(±l,l,2,4,±l,l) = 7i* 
Fs^{±l, -1, 2, 4, ±1, 1) =Hopf link 



Fs3(±l,0,2,l,2,4) = 52 
Fs3(±l,0,2,2,2,3) = 3itt4i' 



Products with 9 crossings 



C(2,8) = 92* 
C(2,-7) = 92* 
C(-2,7) = 92* 
C(-2,-8) = 92* 
C(3,7) = 9?* 
T(2,±9) = 9i* 



Fs, 


:o,q, 


Fs, 


%q, 


Fs, 


[o,q, 


Fs, 


;±i, 


Fs, 


;±i, 


Fs^i 


;±i. 


Fs2 


;±i, 


Fs^i 


;±i, 


Fs2 


;±i, 


Fs,{ 


;±i, 


Fs, 


;±i, 


Fs, 


;±i, 


Fs, 


;±i, 


Fs, 


;±i, 


Fs, 


;±i, 


Fs, 


;±i, 


Fss 


;±i, 


Fs,{ 


;±i, 



— q2 * 

q2 * 

q2 * 



2,1,2,6) 
2,2,2,5) 
2,3,2,4) 

1,2,1,±1,6) = 93* 
-l,2,l,±l,6) = 7i 
l,2,2,±l,5) = 7i 
-l,2,2,±l,5)=Hopf link 
l,2,3,±l,4) = 9n* 
-1,2,3,±1,4) = 52 
1,2,4,±1,3) = 7^ 
-1,2, 4, ±1,3) = 5? 
1,2, 5, ±1,2) = 98* 
-l,2,5,±l,2) = 4i 
l,2,6,±l,l) = 92o* 
-l,2,6,±l,l)=Hopf link 
0,2, 1,2, 6) = 72 



0,2,2,2,5) 
0,2,3,2,4) 



6itt3i 
4itt52 



Table 7. Example of a possible application to Theorem O Given recombination mediated by a 
tyrosine recombinase on the substrates C(2, 5) (MCN = 6) and C(2, 7) (MCN = 8) where exper- 
imental conditions eliminate distributive recombination, we use Table 6 to list all the possible 7 
and 9 noded products of this reaction. Only the products that are isotopic to a knot and link with 
MCN one more than the substrate are possible products of this reaction and we denote these with 
a star (*). 
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crossing chanj 





Figure 1. Twist knots are ubiquitous DNA knots. In the cell all DNA is supercoiled (like an over-used 
phone cord) so an unknot can be transformed to a twist knot by a single crossing change. 



^^ K^ ^y^ 



u. 



(A) 




Figure 2. All possible knots and links resulting from recombination on a twist knot must fall into one of 
these three families: (a) The family F(p,q,r,s,t,u) of knots and links. Most knotted and linked products 
are in this family, (b) The family Gl of knots and links, (c) The family G2 of knots and links. For K G Gl 
or G2, k odd =^ K is a, knot and k even =^ K is a, two component link. 







• • 



• • 



Figure 3. The line represents the axis of the double helix of the substrate DNA molecule. The recom- 
binase dimers (grey circles) bind at each of the two specific sites (filled and hollow arrows) and the sites 
are brought together forming the synaptic complex with crossover sites juxtaposed (second image from the 
left). After cleaving, exchanging and resealing the DNA, the proteins dissociate completing the reaction. 
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Cleave 

both 
duplexes 



IDI 



Switch 
strands 



(A) 




Figure 4. (a). Biological Assumption 3: Serine recombinase. Serine recombinases perform simultaneous 
double-stranded breaks, rotate one half of the recombinase complex relative to the other by 180° and rebind 
the DNA (b). Biological Assumption 3: Tyrosine recombinase. Tyrosine recombinases cleave one strand 
from each duplex, exchange the cleaved strands, and ligates them to form a Holliday junction (rightmost 
two panels). Isomerization of this junction alternates the catalytic activity and the same process happens 
with the other two DNA strands. These images are modifications of Figures 3 and 11 in ^33j . 





U^ 1 




OD-^0 



^ Path 3 

Form ii2 




© 



Figure 5. Mathematical Assumption 3: Serine recombinases. Begin with all possible projections of 
the pre-recombinant conformation of the recombinase complex, with zero or one crossings. Follow with 
projections of the post-recombinant conformations of the productive synapse at each round of processive 
recombination. 
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(c) Isotopy from C{-2,v - 1) to C{2,v) 



)( X X X 5^ XX XX 



-1 +2 -2 

(D) 



+2 



Figure 6. Background terminology, (a) The clasp knot C(r,v) with two nonadjacent rows of crossings, 
one with r / 0, 1 crossings and the other with -u / crossings, (b) The substrate we consider here and 
in |39], the twist knot C(2, v). Note r is now a hook of 2 crossings, (c) A continuous deformation taking the 
twist knot C{—2,v) to the twist knot C(+2, i; + 1). (d) Crossing sign convention used in this paper. 






JuB 



B is a productive synapse 



B is not a 
productive synapse 



Figure 7. Productive synapse. The thin black lines illustrate the central axis of the DNA molecule. We 
assume that the recombinase complex is a productive synapse. B (light grey circle) denotes the smallest 
convex region containing the four bound recombinase molecules (small grey discs) and the two crossover 
sites (highlighted in black). Left and middle: B is a productive synapse. Right: B is not a productive 
synapse. In this case we cannot draw B such that only the two crossover sites are inside it without also 
including the third (horizontal, thin) strand. 
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Uo^ 



(a) 






Figure 8. (a). A Montesinos knot or link has a projection as illustrated here. The Ri are rational 
tangles, (b) A rational tangle with alternating crossings, (c) The partial sum of two tangles, refers to the 
tangle diagram resulting from the insertion of two tangle diagrams into the shaded discs, (d) Numerator 
closure of a tangle. 



Subfamily 1 

F Si(0, q, r, s, t, u) 

V^ Cy O 

u- s • q ! 



Subfamily 2 

FS2(+l,q, r, s, ±l,u) 

^'-^ \A>\ 



q. 



Subfamily 3 

F S3(±l,q,r, s, t, u) 

u. s • 



; 



Subfamily 4 

FS4(p, q, r, s, t, u) 

Cl^fTT^ Jr[>l ^iFi^ 

^^ K^ O 



Subfamily 5 

T(2,q) # C(t,u) 




Subfamily 6 

C(t, u) # C(r, s) 



Cl 



Subfamily 7 
C(r, s) 





Subfamily 8 

T(2,r) 

^r^ K^ o^ 



Subfamily 9 

K(q, s, u) 



u . s 



z:^ 



Figure 9. Subfamilies of the family illustrated in Figure [2a] The nine subfamilies obtained by setting 
p, q,r,s, t, and/or u equal to or ±1 in the family of knots and links F(p, q,r,s, t, u). Top: product subfamily 
Fs-^(0,q,r, s,t,u) with |r|,|t| > 1, product subfamily ^5-2(^1, g, r, s, zbl, u) with \r\ > 1, product subfamily 
Fs^i^l, q,r, s,t, u) with \r\,\t\ > 1, product family ^^^(p, g, r, s, t, w) with |f|,|r|,|7)| > 1, product subfamily 
of composite knots T(2,u)'iC(p,q). Bottom: product subfamily F(— 1,1, r,s,t,u) with |t|,|r| > 1, product 
subfamily of clasp knots and links C(r, s), product subfamily of torus knots and links T(2,r), product 
subfamily of pretzel knots K{p, s,u). 



FormBl 



FormB2 



FormB3 



FormB4 







Figure 10. Assumption 1: Projections of the pre-recombinant productive synapse. Assumption 1 states 
that there is a projection of the pre-recombinant productive synapse with at most one crossing. Note 
that it does allow productive synapses like the hook, where there is a projection with one crossing but 
no projections with zero crossings. Assumption 3 for tyrosine recombinases: After recombination with a 
tyrosine recombinase, the productive synapse has a projections with at most one crossing. 
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Neighbourhood B 




Figure 11. By continuously deforming B H J within a neighbourhood of the hook, we can obtain a 
projection of the hook with exactly one crossing. This affects the projection of the rest of the substrate only 
by adding one positive crossing to the row of v crossings. 




(a) 




Figure 12. Obtain a planar surface with twists by replacing a neighborhood of each arc by a half-twisted 
band. Here our planar surface is a doubly punctured disc and our non-planar surface with twists is a surface 
whose boundary is the twist knot C(2,v). 





Figure 13. A surface D with boundary J is a spanning surface for J if D is topologically equivalent to 
a doubly-punctured planar disc with twists when J is a twist knot C(2,v). 
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% 



(i) Cnj is knotted 




(ii) C n J is unknotted 




Xu^ 



(iii) CnSB contains a circle (iv) Trivial piercing of a supercoil 

Figure 14. (i) A knot is trapped in the DNA branches outside of B. (ii) An unknotted substrate with 
the synaptic complex formed, (iii) The recombinase complex pierces a supercoil in a non-trivial way, DDdB 
contains at least one circle as well as two arcs, (iv) The productive synapse B trivially pierces through a 
supercoil and the circle contained in D D B can be removed via an isotopy of C. Scenarios (i) and (iv) are 
allowed by our assumptions, the other two scenarios are not. 





(a). Subfamily 3 

Fs3(±l,q,2,s,+n, u) for u+s=v 



(b). Subfamily 7 

Twist knots C(2,v) 





(e). Subfamily 7 
C(r+2,v) 

Unknots 

Torus knots T(2,v+1) 

Clasp knots C(r,v) 



(f). Subfamily 8 
T(2,v+n) 

Torus knots T(2,v+n) 





(c). Subfamily 5 
T(2,m)#C(2,v) 



Twist knots C(2,v) 

unknots 

Composite knots T(2,m)#C(2,v) 



(d). Subfamily 1 

Fsi(0,q,2,v,+n, 0) 





(g). Subfamily 7 
C(n,v) 

unknots 

Torus knots T(2,v+1) 

Clasp knots C(+n, v) 



(h). Subfamily 7 
C(2,v+n) 

Twist knots C(2, v+n) 



Figure 15. Products of recombination with serine recombinases. Theorem 2: All possible projections of 
the post-recombinant conformation of the recombinase-DNA complex J U B and the productive synapse B 
after n rounds of processive recombination with a serine recombinase. The images inside the circles denote 
forms nl and n2 of B after n rounds of processive recombination. 



FoniiBl 



Form B2 



Form B3 



Form B4 



Form B5 



FonnB6 



Form B7 



Form BS 











Figure 16. Mathematical Assumption 3: Tyrosine recombinases. Projections of the possible post- 
recombinant conformations of the recombinase complex. Hooks are allowed because they have projections 
with at most one crossing between the sites. 
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©ttO W^ W^ 



fW\} W^ ^f>rO WWO 



cr— '^— :::> 



cr— '^— =:^ 



c ^^^^ :> 



Subfamily 7 Subfamily 2 Subfamily 2 Subfamily 2 Subfamily 2 Subfamily 3 Subfamily 2 

C(2,v) FS2(± 1 ,q,2,s,± 1 ,u) FS2(± 1 ,q,2,s, 1 ,u) FS2(± 1 ,q,2,s,- 1 ,u) FS2(+ 1 ,q,2,s, ±1 ,u+2) FS3(± 1 ,q,2,s,2,u) FS2(± 1 ,q,2,s ±1 ,u-2) 



Subfamily 7 
C(2,v) 



Subfamily 3 
FS3(±l,q,2,s,-2,u^ 







Subfamily 7 
C(2,v) + O 



Subfamily 7 
C(2,v) 



Subfamily 7 
C(2,v/ 



Subfamily 5 
T(2,2) # C(2,v) 




Subfamily 7 
C(2,v) 



Subfamily 5 
T(2,-2) # C(2,v) 





Subfamily 8 Subfamily 7 

Hopf Link T(2,2) C(2,v) 




Subfamily 7 

C(2,v+1) 




Subfamily 7 

C(2,v-1) 




Subfamily 7 

C(2,v+2) 




Subfamily 1 

FSi(0,q,2,s,2,0) 




Subfamily 7 



amily 

C(2,v-2") 




Subfamily 7 
C(2,v) 




Subfamily 1 
FSi(0,q,2,s,-2,0) 



Subfamily 8 
umaioi 



Subfamily 7 
C(2,v) 



Subfamily ! 
T(2,v) 



Subfamily * 
T(2,v) 



Subfamily 8 
T(2,v+1) 



Subfamily < 

T<,2,1) 
unKTiof 



Subfamily I 
T(2,v-1) 



Subfamily 7 
C(3,v) 



Subfamily 1 

T(2,v+2) 



Subfamily 7 
C(-3,v) 



Subfamily 7 
C(-2,v) 



Subfamily 1 

T(: 
unK 



T(2,l) 
inKriox 



Subfamily ^ 

T(2,v-2) 

GlorG2 



Figure 17. Products of recombination with tyrosine recombinases. Theorem 1: Projections of all possible 
conformations of the post-recombinant recombinase-DNA complex JUB and the productive synapse B after 
a reaction with a twist knot substrate C(2, v),v ^ mediated by a tyrosine recombinase. 






C3 



C4 



Subfamily 7 
C(2,v) 



Subfamily 7 
C(4,v) 




C5 



B takes form B 1 B takes forms B 1 , B takes forms B2, B takes form B 1 B takes forms B3 , B4 
B3, B4 B3, B4 

Figure 18. Summary of Lemma 1. All the possible distinct forms that the substrate molecule C H J can 
take, up to isotopy, along with the corresponding forms of B for each form of C n J. 





Figure 19. A thrice-punctured S^ in S^, with arcs connecting the three punctures can be regarded as 
a graph with three points and a collection of edges connecting them. 
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Case (11) 




BnD is two discs 



BnD is a band 



BnD is two discs 



BnD is two discs 



BnD is two discs ^ 



BnD is two discs \ 




Case (23) 



Twist top loop and ^^^Y~\/s./~1 ( 
\ bring sites together v^^ \/ \ / 
' " :u :s :q 



Twist top loop and 
bring sites together 



W^rO 



BnD is two discs 





Twist bottom loop and^ 

)bring sites together iS^r~^0\~l 

:u :s :q 

C ^ — '^-^ 

ci 
Bring the sites 
together inside B - 1 1 j 

— *► not allowed 



BnD is two discs \. 



BnD is two discs 



^^^^ ^v Twist bottom loop and ^~y^ <^^ I j 

/ \ \ bring sites together \ / \ / \ / 

\^1 *- -u -s ;q 



Figure 20. Characterization of the recombinase-DNA complex. Specific sites situated either Case (11): 
both on the boundary of puncture 1 of S*^, Case (23): one on the boundary of puncture 2 of S'^ and the 
other on the boundary of puncture 3 of S*^. In cases where the right-most column says 'not allowed', we 
mean that we can not allow such a conformation because when bringing the two specific sites together inside 
B^ we get C n D non-planar, which is not allowed by assumption 2. 



Case (33) 



Case (22) 







BnD is two discs 



Bring the sites /WV 

together inside B f^i^ ( \/ 



Bring the sites 
together inside B 




;u is A 



Bring the sites 
together inside " 






Bring the sites CI 

together inside B 



:u :s :q^0 




Bring the sites 
together inside B 



;u Is ;q 



f'r\ \ Bring the sites C^^^^^^-T^ 
^/ \ ) together inside B ^^ A/S I J 



BnD is two discs 



(/®\ \ Twist top loop and 
*- v' B \r^^ bring sites together 

BnD is a band ^^X^ 7 




BnD is two discs^ 



^ ^ Twist both the top and 

j^\ \ bottom loops and bring,-— ~-^ 

. I^dK Vhe sites together^ ^^~XK~l J 

iu is ;q 



Continuously 
move B out of 
the clasp 



BnD is two discs 

Twist top loop 
and the middle 
B section between 

^j^ ^\ the two sets of ^ 
(/f\ \vertical ^ (/ 

) crossings, 



111 ;s :q=0 




troducing three 
crossings. 
Bring the sites 
together. 




Figure 21. Characterization of the recombinase-DNA complex. Specific sites situated either Case (22): 
both on the boundary of puncture 2 of S*^, Case (33): both on the boundary of puncture 3 of S*^. Note 
that cases (22) and (33) are equivalent, but we consider both here because it may be more straightforward 
to visualise the isotopy of C H J to one of the standard forms in one case or the other. 
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Case (13) 



Case (12) 



BnD is two discs 




2° ^^^fO 



lu ;s :q 
ci 
not allowed 



not allowed 



not allowed 



not allowed 



not allowed 




See isotopy j^-— ~-^ 

on figure 19 ^^>0<n 1 




-^ not allowed 



-^ not allowed 



not allowed 



not allowed 



Figure 22. Characterization of the recombinase-DNA complex. Specific sites situated either Case (12): 
one on the boundary of puncture 1 of S'^ and the other on the boundary of puncture 2 of S*^, Case (13): 
one on the boundary of puncture 1 of S'^ and the other on the boundary of puncture 3 oi S"^ . In cases where 
the right-most column says 'not allowed', we mean that we can not allow such a conformation because 
when bringing the two specific sites together inside B^ we get C D D non-planar, which is not allowed by 
assumption 2. Note that cases (12) and (13) are equivalent, but we consider both here because it may be 
more straightforward to visualise the isotopy oi C D J to one of the standard forms in one case or the other. 




Continuously 
deform grey 
strand as 
shown, so as 
two remove 
one crossing 
and introduce 
two new 
crossings 




rotation axis 



Rotate section 
inside dotted 
circle by 180° 
about the axis, 
s o as to remov e 
the two 
previous new 
crossings and 
introduce a 
new crossing 
between the 
two grey arcs. 



New crossing introduced by 
rotating section inside dotted 
circle by 180° 




Figure 23. isotopy for Case (lie): Assume both arcs of DCidB lie on boundary 1 of the thrice-punctured 
5^ C S^. The thrice-punctured S'^ generates the spanning surface D that has boundary illustrated on the 
second image from the left. Next, a continuous deformation taking this conformation of J to form CI. 
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v+1 




Twist top loop to 
change crossings 
of the clasp from 
+2 to -2 and to 
change position of 
the site from 
boundary 2 to 
b oundary 3. 




Continuously 
defomr grey 
strand downwards 
to place the two 
crossings in the 
middle next to 
each other. 




Place the sites next 
to each other 



Figure 24. Isotopy for Case (12a): Assume both arcs oiDr\dB lie on boundary 1 of the thrice-punctured 
S'^ C S^ . The thrice-punctured S'^ generates the spanning surface D that has boundary illustrated on the 
second image from the left. Next, a continuous deformation taking this conformation of J to form CI. 
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c^ 
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K(q, s, u) 



Figure 25. Proof of Theorem 4: Our family in Figure [2a] was broken down into these subfamilies to be 
able to compute Tables 2,3,4 and 5. 
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and 




Figure 26. Haya-Yamamoto: A projection of a knot or link is Hara-Yamamoto if when we cut off the 
row of p crossings on the left and reseal the strands in the two natural ways then both resulting projections 
are reduced alternating. 





Figure 27. Example of strand movement: By moving the two strands, we reduce from \r\ + \u\ + \q\ 
crossings originally, to|r| + |w| + |g|— 2 crossings in the alternating diagram. 
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